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ARtlFICIAL TRANSCRIPTION FACTORS 

rrnss-Referpnrp. tn Relate d Applications 

ms patent is a continuation-in-part of United States Provisional Patent AppUcation 
Serial Number 60/388.055, filed June 11, 2002, the disclosure of which is incorporated herein 
by reference. 

TRr.hnical Fi ^lH the Invention 

The field of this invention is gene transcription. More particularly, this invention 
provides a gene transcription regulating polypeptide that contains a pluraUty of DNA binding 
domains directed to different target nucleotide sequences within one or more genes. 

Rackgrounf ^ Invention 

The construction of artificial transcription factors has been of great interest in the past 
years Gene expression can be specifically regulated by polydactyl zinc finger proteins fiised 
to regulatory domains (See, e.g.. United States Patent Numbers 6.242,568; 6,140.466; and 
6,140.081, the disclosures of which are incorporated herein by reference). 

Zinc finger domains of the Cys,-His, family have been most promising for the 
construction of artificial transcription factors due to their modular stmcture. Each domain 
consists of approximately 30 amino acids and folds into aPPa stmcture stabilized by 
hydrophobic interactions and chelation of a zinc ion by the conserved Cys,-His, residues. To 
date the best characterized protein of this family of zinc finger proteins is the mouse 
transcription factor Zif 268 [Pavletich et al.. (1991) Sci..e 252(5007), 809-817; Elrod- 
Erickson et al., (1996) Structure 4(10). 1 171-1 180]. The analysis of the Zif 268/DNA 
complex suggested that DNA binding is predominantly achieved by the interaction of ammo 
acid residues of the a-helix in position -1, 3, and 6 with the 3', middle, and 5' nucleotide of a 
3 bp DNA subsite. respectively. Positions 1, 2 and 5 have been shown to make direct or 
water-mediated contacts with the phosphate backbone of the DNA Leucine is usually found 
in position 4 and packs into the hydrophobic core of the domain. Position 2 of the a-hehx 
has been shown to interact with other helix residues and, in addition, can make contact to a 
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nucleotide outside the 3 bp subsite [Pavletich et al., (1991) Science 252(5007). 809-817; 
Elrod-Erickson et al.. (1996) Structure 4(10). 1171-1180; Isalan. M. et al.. (1997) ProcNatl 
AcadSci USA 94(11), 5617-5621]. 

Zinc finger DNA binding domains can be assembled into zinc finger proteins 
recognizing extended 1 8 bp DNA sequences which are unique within the human or any other 
genome, hi addition, these proteins fimction as transcription factors and are capable of 
altering gene expression when fused to regulatory domains and can even be made hoimone- 
dependentby fiision to Ugand-binding domains of nuclear hormone receptors. To date, 
however, polypeptides containing one or more zinc finger binding domains target a single 
gene or contain a single transcription regulating domain. There is a need in the art, therefore, 
for transcription regulating polypeptides that can be used to target more than one gene or 
contain more than one transcription regulating domain. 

The present disclosure provides polypeptides that contain a plurality of DNA binding 
domains and one or more transcription regulating domains. Such polypeptides can be used to 
regulate transcription of more than one target gene or to enhance the activation or repression 
of single genes. 



Brief Summary of the hivention 

The present invention provides a non-naturaUy occurring artificial transcription factor 
polypeptide comprising a plurality of DNA binding domains (DNB) operatively linked to 
each other. The DNA bmding domains each bind independently to a same or different 
nucleotide sequence. The polypeptide can further contain one or more transcription 
regulating domains, each of which is operatively linked to one of the DNA binding domains. 

The different nucleotide sequences are located in a transcriptional control region of 
the same gene or different genes. Where the nucleotide sequences are located in 
transcriptional control regions of a single gene, such nucleotide sequences are separated from 
each other by at least 10 base pairs. The polypeptide contains two or more DNBs. hi one 
embodiment, the polypeptide contains two or three DNA binding domains. Each DNA 
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binding domain preferably contains from 3 to 6 zanc finger peptides and, more preferably 6 
zinc finger peptides. 

The DNA binding domains are preferably operatively Unked to each olher with an 
amino acid residue sequence of from 5 to 50 amino acid residues, preferably fi»m 5 to 40 
amino acid residues, more preferably &om 5 to 30 amino acid residues and, even more 
preferably fi»m 5 to 15 amino acid residues. 

In another aspect, the present invention provides a polynucleotide that encodes a 
polypeptide of this invention, an expression vector that contains such a polynucleotide and a 
cell transformed with such a polynucleotide or expression vector. 

In yet another aspect, the present invention provides processes for regulating gene 
transcription. In one embodiment, a present method is directed to simultaneously regulating 
transcription of a plurality of DNA target genes in a cell. Such a method comprises the steps 
of transforming the cell with a polynucleotide that encodes a polypeptide having a plurality of 
operatively Unked DNA binding domains, each of which DNA binding domams specifically 
bmds to a nucleotide sequence in a transcriptional control region of different DNA target 
genes and maintaining the cell under conditions and for a period of time sufficient for 
expression of the polypeptide. In a second embodiment, a method is directed to regulating 
transcription of a single gene. Such a method comprises the steps of transforming the cell 
with a polynucleotide that encodes a polypeptide having a plurahty of operatively linked 
DNA binding domains, each of which DNA binding domains specifically binds to a different 
nucleotide sequence in a transcriptional control region of the DNA target gene and 
maintainmg the cell under conditions and for a period of time sufficient for expression of the 
polypeptide. Preferably, a method of this invention uses a polypeptide that also contains one 
or more transcription regulating domains. 



Brief Description of the Drawings 

In the drawings that form a portion of the specification, 

FIG. 1 shows a schematic representation of a transcription factor polypeptide of this 
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invention. DNB represents a DNA binding domain. N is fiom 1 to 10. -Represents an 
amino acid residue linker. 

FIG. 2 shows exemplary arrangements of DNBs and repressor (SKD) or activation 
(VP64) transcription regulating domains assembled out of two DNBs connected by a flexible 
linker. 



Dfitailed Description of the Invention 
I. The Invention 

The present invention provides non-naturally occurring transcription factor 
polypeptides useful for regulating gene transcription, polynucleotides that encode such 
polypeptides and the use of such polypeptides and polynucleotides in regulating gene 
transcription. 

n. Polypeptides 

The present invention provides non-naturally occurring polypeptides that contain a 
plurality of DNA binding domains pNB). which binding domains are derived from zinc 
fingerDNAbindingpeptidesCSeeHG. 1). A polypeptide ofthis invention is non-naturally 
occurring. As used herein, the term "non-naturally occurring" means, for example, one or 
more of the following: (a) a polypeptide comprised of a non-naturally occurring amino acid 
sequence; (b) a polypeptide having a non-naturaUy occurring secondary structure not 
associated with the polypeptide as it occurs in nature; (c) a polypeptide that includes one or 
more amino acids not normally associated with the species of organism in which that 
polypeptide occurs innature; (d) a polypeptide that includes a stereoisomer of one or more of 
the amino acids comprising the polypeptide, which stereoisomer is not associated with the 
polypeptide as it occurs in nature; (e) a polypeptide that includes one or more chemical 
moieties other than one of the natural amino acids; or (f) an isolated portion of a naturally 
occurring amino acid sequence (e.g.. a truncated sequence). A polypeptide of this invention 
exists in an isolated form and purified to be substantially free of contaminating substances. A 
polypeptide can be synthetic in nature. That is. the polypeptide is isolated and purified from 
natural sources or made de novo using techniques weU known in the art. A polypeptide of 
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this invention can be made using a variety of standard techniques well known in the art. 

Amino acid residues of polyp^tides are expressed herein using the standard 1 or 3- 
letter codes (See Table 1, below). 
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TABLE 1 



5 



10 



15 



20 



Amino Acid 


3 Letter 
i^ode 


1 Letter 

^OuC 


Alanine 


Ala 


A 

A 


Cysteine 


uys 




Aspartic Acid 


Asp 


U 


Glutamic Acid 


CjIu 


T7 

iZr 


Phenylalanine 


Fne 


TJ 

r 


Glycine 


Gly 


KJ 


Histidine 


His 


XT 

ri 


Isoleucine 


T1 _ 

He 


1 


Lysine 


Lys 




Leucine 


Leu 


T 


Methionine 


Met 


JVl 


Asparagine 


A 

Asn 


XT 

JN 


Proline 


Pro 


T) 

F 


Glutamine 


Gin 


r\ 


Arginine 


A _ ~ 

Arg 


iv 


Serine 


Ser 


s 


Threonine 


Thr 


T 


Valine 


Val 


V 


Tryptophan 


Trp 


w 


Tyrosine 


Tyr 


Y 


Stop Codons 




Z 



In certain embodiments, a polypeptide variant comprises a conservatively substituted 
amino acid residue. It is preferred that each amino acid substitution is made by substituting 
the amino acid of interest with an amino acid from a group of similar amino acid(s) as listed 
in the Table 2, below. (See Biochemistry, 3rd Edition, Stryer, Freeman PubUsher (1988) 
pages 16-40, incorporated herein by reference). 
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Refeiring to the Table 2, for example, in certain embodiments, a G amino acid residue 
in a desired polypeptide is substituted with an A, V. L, or L In another example, an N residue 
in a desired polypeptide is substituted with a D, E, or Q. It is generally preferred that the first 
amino acid (or codon in the underlying polynucleotide) of an open reading firame is 
methionine. 

TABLE 2 

Preferred Amino Acid Grouping for Conservative Substitution 



Side Chain Characteristic 


Conservative Amino Acid Groups 


Aliphatic 


G,A,V,L,I 


Aliphatic with secondary amino group 


P 


Aromatic 


F,Y,W 


Siilftir containing 


C,M 


Aliphatic hydroxyl 


S, T 


Basic 


K, R, H 


Acidic 


D,E,N,Q 



A DNA binding domain of an instant polypeptide is derived or isolated firom zinc 
finger DNA binding peptides, which peptides are well known in the art. Preferably, the zinc 
finger DNA bindingpeptide is derivedfromaCys,-His,typezinc finger. A zinc finger DNA 
binding peptide derivative can be derived or produced fi-om a wild type zinc finger protein by 
truncation or expansion, or as a variant of a wild type-derived peptide by a process of site 
directed mutagenesis, or by a combination of the procedures (See^. United States Patent 
Numbers 6.242,568; 6,140.466; and 6.140,081, the disclosures of which are incorporated 
herein by reference). The term "tnmcated" refers to a zinc finger-nucleotide binding 
polypeptide that contains less that the fiiU number of zinc fingers found in the native zinc 
finger binding protein or that has been deleted of non-desired sequences. For example, 
truncation of the zinc finger-nucleotide bmding protem TFHIA, which naturally contains nine 
zinc fingers, might be a polypeptide with only zinc fingers one through three. Expansion 
refers to a zinc finger polypeptide to which additional zinc finger modules have been added. 
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For example, TFIIIA may be extended to 12 fingers by adding 3 zinc finger domains. 

In addition, a truncated zinc finger-nucleotide binding polypeptide may include zinc 
finger modules firom more than one ^^dld type polypeptide, thus resulting in a "hybrid" zinc 
finger-nucleotide binding polypeptide. The term «'mutagenized" refers to a zinc finger 
derived-nucleotide binding polypeptide that has been obtained by performing any of the 
known methods for accompUshing random or site-directed mutagenesis of the DNA encoding 
the protein. For instance, in TFIIIA. mutagenesis can be performed to replace nonconserved 
residues in one or more ofthe repeats ofthe consensus sequence. Truncated zinc finger- 
nucleotide binding proteins can also be mutagenized. Examples of known zinc finger- 
nucleotide binding polypeptides that can be truncated, expanded, and/or mutagenized 
according to tiie present invention in order to inhibit the fimction of a nucleotide sequence 
containing a zinc finger-nucleotide binding motif includes TFIIIA and ziG68. Other zinc 
finger-nucleotide binding proteins will be known to those of skill in the art. 

A polypeptide of this invention comprises a plurality of DNA binding domains. 
Preferably, the polypeptide contains from 2 to 10 such domains, more preferably from 2 to 5 
such domains and. most preferably, 2 or 3 such domains. The DNA binding domains are 
operatively linked to each other. By "operatively linked" is meant tiiat the structure and 
function of each DNA binding domain is unaffected by tixe linking of any otiier such domain, 
m one embodiment, the DNA binding domains are directiy linked or bonded together via well 
known peptide linkages. In another embodiment, the DNA binding domains are operatively 
linked using a peptide linker containing from 5 to 50 amino acid residues. Preferably, the 
linker contains from 5 to 40 amino acid residues, more preferably from 5 to 30 amino acid 
residues and. even more preferably from 5 to 15 amino acid residues. The linkers are 
preferably flexible. Exemplary such hnkers are set forth below. 



Linker 1 : TGEKP (SEQ ID NO: I) 



PGGGGSGGGGTGSSRSSSTGEKP (SEQIDN0:2) 
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Linker 3: PGSSGGGGSGGGGGGSTGGGSGGGGTGSSRSSSTGEKP (SEQ ID 
N0:3) 

Linker 4: TGGGGSGGGGTGEKP (SEQ ID N0:4) 

Where a transcription factor polypeptide of this invention contains 2 DNA binding 
domains, a single linker operatively links those domains. Where more than two DNA binding 
domains are present, a linker is used to operatively Unk each binding domain. In such an 
embodiment, the same or different linker can be employed at each linking location. 

DNA binding domains used in the present transcription factors can be naturally- 
occurring or non-naturally occurring. Naturally-occurring zinc finger DNA binding domains 
are well known in the art. In a preferred embodiment, at least one DNA binding domain of a 
present transcription factor is non-naturally occurring. Each of the DNA binding domains is 
preferably designed and made to specifically bind nucleotide target sequences corresponding 
to the fonnula 5'-NNN-3', where N is any nucleotide (i.e., A. C. G or T). Such DNA binding 
domains are well known in the art ( See, e. g., U.S. Patent Nos. 6.242.568, 6.140.466 and 
6.140.081. the disclosures of which are incorporated herein by reference). A zinc finger DNA 
binding peptide of this invention comprises a unique heptamer (contiguous sequence of 7 
amino acid residues) withm the a-heUcal domain of the peptide, which heptameric sequence 
determines binding specificity to a target nucleotide. That heptameric sequence can be 
located anywhere within the a-heUcal domain but it is preferred that the heptamer extend 
from position -1 to position 6 as the residues are conventionally numbered in the art. A 
peptide can include any p-sheet and framework sequences known in the art to fimction as part 
of a zinc finger peptide. 

Previously we reported the characterization of 16 zinc finger domains specifically 
recognizing each of the 5'-GNN-3' type of DNA sequences, that were isolated by phage 
display selections based on C7, a variant of the mouse transcription factor Zif268 and refined 
by site-directed mutagenesis [United States Patent Number 6.140.081, the disclosure of which 
orporated herein by reference]. Briefly, phage display Ubraries of zinc finger proteins 
created and selected under conditions that favored enrichment of sequence specific 



is ma 
were 
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proteins. Zinc finger domains recognizing a number of sequences required refinement by 
site-directed mutagenesis that was guided by both phage selection data and structural 
information. A simUar system has been employed to identify domains that recognize the 5'- 
TNN-3' type of DNA sequences. 

To extend the avaUabiUty of zinc finger domains for the construction of artificial 
transcription factors, domains specificaUy recognizing the 5'-ANN-3' and 5'-CNN-3' types 
of DNA sequences were selected. Briefiy, the helix TSG-N-LVR (SEQ ID NO: 5), 
previously characterized in finger 2 position to bind with high specificity to the triplet 5'- 
GAT-3', containing finger 1 and 2 of C7 and the 5 '-GAT-3 '-recognition helix in finger-3 
position, was analyzed for DNA-binding specificity on targets with different finger-2 subsites 
by multi-target ELISA in comparison with the original C7 protein [See, e.g., Dreier, B. et al.: 
y5io/ CAe/M (2001) Aug 3;276(31):29466-78]. 

A polypeptide of this invention can fiirther comprise one or more transcription 
regulating domains. A transcription regulating domain can be an activation domain or a 
repression domain, as is well known in the art. An exemplary repression domain peptide is 
the ERF repressor domain (ERD), (Sgouras, D. N., Athanasiou, M. A., Beal, G. J., Jr., Fisher, 
R. J., Blair, D. G. & Maviothalassitis, G. J. (1995) EMBO J. 14, 4781-4793), defined by 
amino acids 473 to 530 of the ets2 repressor factor (ERF). This domain mediates the 
antagonistic effect of ERF on the activity of transcription factors of the ets family. A second 
repressor protein is prepared using the gnippel-associated box (KRAB) domain (MargoUn, J. 
F., Friedman, J. R.. Meyer, W., K.-H., Vissing, H., Thiesen, H.-J. & Rauscher m, F. J. (1994) 
Proc. Natl. Acad. Sci. USA 91, 4509-4513). This repressor domain is commonly found at the 
N-tenninus of zinc finger proteins and presumably exerts its repressive activity on TATA- 
dependent transcription in a distance- and orientation-independent manner (Pengue, G. & 
Lania. L. (1996) Proc. Natl. Acad. Sci. USA 93, 1015-1020), by interacting with the RING 
finger protein KAP-1 (Friedman, J. R., Fredericks, W. J., Jensen, D. E.. Speicher, D. W., 
Huang, X.-P., Neilson, E. G. & Rauscher IE, F. J. (1996) Genes & Dev. 10. 2067-2078). We 
utilized the KRAB domain found between amino acids 1 and 97 of the zinc finger protein 
KOXl (Margolin, J. F., Friedman. J. R., Meyer, W., K.-H., Vissing, H., Thiesen, H.-J. & 
Rauscher ffl, F. J. (1994) Proc. Natl. Acad Sci. USA 91, 4509-4513). In this case anN- 



wo 03/104414 PCT/US03/17946 

-11- 

tenninal fusion with a zinc-finger polypeptide is constructed. Finally, to explore the utility of 
histone deacetylation for repression, amino acids 1 to 36 of the Mad mSIN3 interaction 
domain (SID) are fused to the N-terminus of the zinc finger protein (Ayer. D. E., Laherty. C. 
D.. Lawrence. Q. A.. Armstrong, A P. & Eisenman, R. N. (1996) Mol Cell. Biol. 16. 5772- 
5781). This small domain is found at the N-terminus of the transcription factor Mad and is 
responsible for mediating its transcriptional repression by interacting with mSIN3, which in 
turn interacts the co-repressor N-CoR and with the histone deacetylase mRPDl (Heinzel. T.. 
Lavinsky, R. M., Mullen, T.-M., SSderslrsm, M.. Laherty. C. D.. Torchia. J., Yang, W.-M.. 
Brard, G.. Ngo, S. D. & al., e. (1997) Nature 387. 43-46). To examine gene-specific 
activation, transcriptional activators are generated by fiising the zinc finger polypeptide to 
amino acids 413 to 489 of the herpes simplex virus VP16 protein (Sadowski. L. Ma, J., 
Triezenberg. S. & Ptashne, M. (1988) Nature 335. 563-564), or to an artificial tetrameric 
repeat of VP16's minimal activation domain, (Seipel, K., Georgiev. O. & SchafiBier, W. 
(1992)£M50J. 11, 4961-4968). termed VP64. The transcription regulating domains can be 
operatives linked to aDNA binding domain at either theN- or C-terminus of the binding 
domain. 

A transcription regulating domain, when present, can be situated at either the N- or C- 
terminal of a present polypeptide or adj acent to and between a DNA bindmg domain and a 
Unker (see FIG. 2). A polypeptide of this mvention can contain one or more transcription 
regulating domains. Where a plurality of transcription regulating domains are present, each 
domain can be the same or different. Similarly, a single polypeptide can contain both 
repressor and activation domains. FIG. 2 shows an exemplary polypeptides of this invention 
having two DNA binding domains and either a single repressor or single activation domain or 
a combination of such repressor and activation domains. 

m. Pn1vnucleotide.s and Ex pression Vectors 

The invention includes a nucleotide sequence encoding a zinc finger-nucleotide 
binding polypeptide. DNA sequences encoding the zinc finger-nucleotide binding 
polypeptides of the invention, includmg native, truncated, and expanded polypeptides, can be 
obtained by several methods. For example, the DNA can be isolated usmg hybridization 
procedures which are well known in the art. These include, but are not limited to: 
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(1) hybridization of probes to genomic or cDNA libraries to detect shared nucleotide 
sequences; (2) antibody screening of expression Ubraries to detect shared structural features; 
and (3) synthesis by the polymerase chain reaction (PGR). SNA sequences of the invention 
can be obtained by methods known in the art (See. for example, Oirrant Protocols in 
Mnleciilar Biology, Ausubel, et al., Eds., 1989). 

The development of specific DNA sequences encoding zinc fmger-nucleotide binding 
polypeptides of the invention can be obtained by: (1) isolation of a double-stranded DNA 
sequence from the genomic DNA; (2) chemical manufacture of a DNA sequence to provide 
the necessary codons for the polypeptide of interest; and (3) in vitro synthesis of a double- 
stranded DNA sequence by reverse transcription of mRNA isolated from a eukaiyotic donor 
cell, hi the latter case, a double-stranded DNA complement of mimA is eventually fomied 
which is generally referred to as cDNA. Of these three methods for developing specific DNA 
sequences for use in recombinant procedures, the isolation of genomic DNA is the least 
common. This is especially true when it is desirable to obtain the microbial expression of 
mammahan polypeptides due to the presence of introns. 

For obtaining zinc finger derived-DNA binding polypeptides, the synthesis of DNA 
sequences is frequently the method of choice when the entire sequence of amino acid residues 
of the desired polypeptide product is known. When the entire sequence of amino acid 
residues of the desired polypeptide is not known, the direct synthesis of DNA sequences is 
not possible and the method of choice is the formation of cDNA sequences. Among the 
standard procedures for isolating cDNA sequences of interest is the formation of plasmid- 
carrying cDNA libraries which are derived from reverse transcription of mRNA which is 
abundant in donor cells that have a high level of genetic expression. When used in 
combination with polymerase cham reaction technology, even rare expression products can be 
clones, hi those cases where significant portions of the amino acid sequence of the 
polypeptide are known, the production of labeled single or double-stranded DNA or KNA 
probe sequences dupUcating a sequence putatively present in the target cDNA may be 
employed in DNA/DNA hybridization procedures which are carried out on cloned copies of 
the cDNA which have been denatured into a smgle-stranded form (J ay. et al.. Nucleic Acid 
Research 11:2325, 1983). 
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IV. Pharmaceutical Comp ositions 

In another aspect, the present invention provides a pharmaceutical composition 
comprising atherapeutically effective amount of a polypeptide of this invention or a 
therapeutically effective amount of a nucleotide sequence that encodes such apolypeptide m 
combination with a phaimaceutically acceptable earner. 

As used herein, the terms "pharmaceutically acceptable", '•physiologically tolerable" 

and grammatical variations thereof, as they refer to compositions, carriers, diluents and 

reagents, are used interchangeably and represent that the materials are capable of 
administration to or uponahuman without theproduction of undesirablephysiologicdeff^^^ 

such as nausea, dizzmess. gastric upset and the like which would be toadegreethat would 
prohibit administration of the composition. 

m preparation of a pharmacological composition that contains active ingredients 

dissolved or dispersed Iherein is well understood in the art. Typically such compositions are 

prepared as sterile injectables either as liquid solutions or suspensions, aqueous or non- 
aqueous, however, soUd fonns suitable for solution, or suspensions, in liquid prior to use can 

also be prepared. The preparation can also be emulsified. 

m active ingredient can be mixed with excipients which are pharmaceutically 
acceptable and compatible with ti.e active mgredient and in amounts suitable for use m Ihe 
ti^erapeutic methods described herem. Suitable excipients ^e. for example, water, sahne, 
dextrose, glycerol, ethanolor the like and combinations thereof, in addition, if desired, t^^ 

. compositioncancontainminoramountsofauxiUarysubstancessuchaswettingor 
emulsifying agents, as well as pH buffering agents and the like which enhance the 
effectiveness of the active ingredient. 

The therapeutic pharmaceutical composition of the present invention can include 
pham.aceuticaUyacceptablesaltsofti.ecomponentsti.erein. Pharmaceutically acceptable 
salts include the acid addition salts (formed with ti.e free amino groups of ti.e polypeptide) 
that are formed witir inorganic acids such as. for example, hydrochloric or phosphoric acrds. 
or such organic acids as acetic, tartaric, mandelic and the hke. Salts formed with the free 
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caAoxyl groups can also be derived from inorganic bases such as, for example, sodium, 
potassium, ammonium, calcium or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine. procaine and the like. 

Physiologically tolerable carriers are well known in the art. Exemplary of Uquid 
carriers are sterile aqueous solutions that contain no materials in addition to the active 
ingredients and water, or contain a buffer such as sodium phosphate at physiological pH 
value, physiological saline or both, such as phosphate-buffered saline. Still further, aqueous 
carriers can contain more than one buffer salt, as weU as salts such as sodium and potassium 
chlorides, dextrose, propylene glycol, polyethylene glycol and other solutes. Liquid 
compositions can also contain Uquid phases in addition to and to the exclusion of water. 
Exemplary of such additional Uquid phases are glycerin, vegetable oils such as cottonseed oil. 
organic esters such as ethyl oleate, and water-oil emulsions. 

V. • IJs^ 

In one embodiment, a method of the invention includes a process for modulating 
(inhibiting or suppressing) expression of a nucleotide sequence comprising a binding motif, 
which method includes the step of contacting the binding motif with an effective amount of a 
subject polypeptide that binds to the motif. The binding motif is preferably located in a 
transcriptional control region of the target gene. A transcriptional control region is any region 
of a gene involved in regulating transcription. An exemplary such region is a promoter, hi 
the case where the nucleotide sequence rs a promoter, the method includes inhibitmg the 
transcriptional transactivationofagenecontainingazincfinger-DNAbindingmotif. The 

term "inhibiting" refers to the suppression of the level of activation of transcription of a 
structural gene containing a zinc finger-nucleotide binding motif, for example. In addition, 
the gene transcription regulating polypeptide may bind a motif within a structural gene or 
within an RNA sequence. 

The term "effective amount" includes that amount which results in the deactivation of 
a previously activated promoter or that amount which results in the inactivation of a promoter 
containing a zinc finger-nucleotide binding motif, or that amount which blocks transcription 
of a structural gene or translation of RNA. The amount of gene transcription regulating 
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polypeptide required is that amount necessary to either displace a native zinc finger- 
nucleotide binding protein in an existing protein/promoter complex, or that amount necessary 
to compete with the native zinc finger-nucleotide binding protein to fonn a complex with the 
promoter itself. SimUarly, the amount required to block a structural gene or RNA is that 
amount which binds to and blocks RNA polymerase from reading through on the gene or that 
amount which inhibits translation, respectively. Preferably, the method is perfomied 
intracellularly. By functionally inactivating a promoter or structural gene, transcription or 
translation is suppressed. DeUvery of an effective amount of the inhibitory protein for 
binding to or "contacting" the cellular nucleotide sequence containing the zinc finger- 
nucleotide binding protein motif, can be accomplished by one of the mechanisms described 
herein, such as by retroviral vectors or Uposomes, or other methods well known in the art 

The term "modulating" refers to the suppression, enhancement or induction of a 
function. For example, the gene transcription regulating polypeptide of the invention may 
modulate a promoter sequence by binding to a motifwithin the promoter, thereby enhancmg 
or suppressing transcription of a gene operatively Unked to the promoter nucleotide sequence. 
Alternatively, modulation may include inhibition of transcription of a gene where the gene 
transcription regulating polypeptide binds to the structural gene and blocks DNA dependent 
RNA polymerase from reading through the gene, thus inhibiting transcription of the gene. 
The structural gene maybe a normal cellular gene or an oncogene, for example. 
Alternatively, modulation may include inhibition of translation of atranscript. 

The promoter region of a gene includes the regulatory elements that typically Ue 5' to 
a structural gene. If a gene is to be activated, proteins known as transcription factors attach to 
the promoter region of the gene. Tliis assembly resembles an "on switch" by enabhng an 
enzyme to transcribe a second genetic segment from DNA to RNA. In most cases the 
resulting RNA molecule serves as atemplate for synthesis of a specific protein; sometimes 
RNA itself is the final product. 

The promoter region may be a normal cellular promoter or, for example, an onco- 
promoter. An onco-promoter is generally a virus-derived promoter. For example, the long 
terminal repeat (LTR) of retroviruses is a promoter region which may be a target for a zmc 
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finger binding polypeptide variant of the invention. Promoters from members of the 
Lentivirus group, which include such pathogens as human T-cell lymphotrophic virus 
(HTLV) 1 and 2, or human immunodeficiency virus (HIV) 1 or 2, are examples of viral 
promoter regions which may be targeted for transcriptional modulation by a polypeptide of 
the invention. 

The Examples that follow show the use of polypeptides of this invention to alter 
expression of polynucleotides encoding particular gene products. The Examples are 
representative of particular embodiments of this mvention and are not limiting of the 
specification and/or claims in any way. 

EXAMPLE !: TranscriotioT i Factor Polypeptides 

A series of ti-anscription factor polypeptides that target specific genes were made and 
used to alter expression of gene products. E2c and E2x are six finger proteins that bind in the 
post-transcriptional and pre-translatorial region of the erbB2 gene, as fiision proteins with the 
effector domains vp64 and SKD they regulate erbB2 expression in both directions [BeerU, R. 
et al: PNAS (1998), 95, 14628-14633; and Dreier, B. et al: J Biol Chem (2001) Aug 
3;276(31):29466-78]. E3 and E3Y are six finger proteins that bind in the post transcriptional 
a^d pre-translatorial region of the erbB3 gene, as fiision proteins with the effector domains 
vp64 and SKD they regulate erbB3 expression in both directions [BeerU, R. et al: 
PNAS(1998), 95, 14628-14633; and Dreier, B. et al: J Biol Chem (2001) Aug 
3;276(31):29466-78]. Exemplary polypeptides are shown below togetiier with the DNA 
target sequence for that polypeptide. The six finger proteins were generated as described 
elsewhere (Segal. D et al: PNAS (1999), 96,2758-2763). 

E2cJ15E3Y 

MAQAALEPGEKPYACPECGKSFSRKDSLVRHQRTHTGEKPYKCPECGKSFSQSGDIA 

RHQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYACPECGKSFSQSSHLVR 

HQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYKCPECGKSFSRSDKLVRH 

QRTHTGGGGSGGGGTGEKPYACPECGKSFSDKKDLTOHQRTHTGEKPYKCPECGKS 

FSDCRDIJ^QRTHTGEKPYKCPECGKSFSQLAHLRAHQRIHTGEKPYACPECGKSF 

SQSGDLRRHQRTHTGEKPYKCPECGKSFSRSDNLVRHQRTHTGEKPYKCPECGKSFS 

DPGALRVHQRTHTGKKTSGQAG (SEQ ID N0:6) 
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DNA TARGET SEQUENCE:E2c: GGG GCC GGA GCC GCA GTG (SEQ ID NO:7); 
E3 Y: ATC GAG GCA AGA GCC ACC (SEQ ID NO:8) 

E2xJ15E3 

MAQAALEPGEKPYACPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKSFSRSDHLA 
EHQRTHTGEKPYKCPECGKSFSDKKDLTRHQRTHTGEKPYACPECGKSFSQSSNLVR 
HQRTHTGEKPYKCPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKSFSDKKDLTRH 
QRTHTHTGGGGSGGGGTGEKPYACPECGKSFSDPGALVRHQRTHTGEKPYKCPECG 
KSFSQSSHLVRHQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYACPECGK 
SFSQSSHLVRHQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYKCPECGKS 
FSQSSHLVRHQRTHTGKKTSGQAG (SEQ ID NO:9) 

DNA TARGET SEQUENCE:E3: GGA GCC GGA GCC GGA GTC (SEQ ID NO:10); 
E2X: ACC GGA GAA ACC AGG GGA (SEQ ID NO:ll) 

E3J15E2X 

MAQAALEPGEKPYACPECGKSFSDPGALVRHQRTHTGEKPYKCPECGKSFSQSSHLV 

lOIQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYACPECiGKSFSQSSHLVR 

HQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYKCPECGKSFSQSSHLVRH 

QRTHTGGGGSGGGGTGEKPYACPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKS 

FSRSDHLAEHQRTHTGEKPYXCPECGKSFSDKKDLTRHQRTHTGEKPYACPECGKSF 

SQSSNLVRHQRTHTGEKPYKCPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKSFS 

DKKDLTRHQRTHTGKKTSGQAG (SEQ ID NO:12) 

DNA TARGET SEQUENCE:E3: GGA GCC GGA GCC GGA GTC (SEQ ID NO: 10); 
E2X: ACC GGA GAA ACC AGG GGA (SEQ ID N0:13) 

E2cJlSE3 

MAQAALEPGEKPYACPECGKSFSRKDSLVRHQRTHTGEKPYKCPECGKSFSQSGDLR 

RHQRTHTGEKPYKCPECGKSFSDCRDIJ\RHQRTHTGEKPYACPECGKSFSQSSHLVR 

HQRTHTGEKPYKCPECGKSFSDCRDIARHQRTHTGEKPYKCPECGKSFSRSDKLVRH 

QRTHTGGGGSGGGGTGEKPYACPECGKSFSDPGALVRHQRTHTGEKPYKCPECGKS 

FSQSSHLVRHQRTHTGEKPYKCPECGKSFSDCRDLARHQRTHTGEKPYACPECGK5F 
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SQSSHLVRHQRIOTGEKPYKCTECGKSFSDCRDIJ^QRTHTGEKPYKCPECGKSFS 
QSSHLVRHQRTHTGKKTSGQAG (SEQ ID NO: 14) 

DNA TARGET SEQUENCE:E2c: GGG GCC GGA GCC GCA GTG (SEQ ID NO:7); 
E3: GGAGCCGGAGCCGGAGTC(SEQIDNO:10) 

E2xJ15E3y 

MAQAALEPGEKPYACPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKSFSRSDHLA 

EHQRTHTGEKPYKCPECGKSFSDKKDLTEIHQRTHTGEKPYACPECGKSFSQSSNLVR 

HQRTHTGEKPYKCPECGKSFSQSSHLVRHQRTHTGEKPYKCPECGKSFSDKKDLTRH 

QRTHTHTGGGGSGGGGTGEKPYACPECGKSFSDKKDLTRHQRTHTGEKPYKCPECG 

KSFSDCRDLARHQRTHTGEKPYKCJPECGKSFSQLAHLRAHQRTHTGEKPYACPECG 

KSFSQSGDIJiRHQRTHTGEKPYKCPECGKSFSRSDNLVRHQRTHTGEKPYKCPECGK 

SFSDPGALRVHQRTHTGKKTSGQAG (SEQ ID N0:15) 

DNA TARGET SEQUENCE:E2X: ACC GGA GAA ACC AGG GGA (SEQ ID NO:13); 
E3Y: ATC GAG GCA AGA GCC ACC (SEQ ID N0:8) 

Various combinations of E2c, E2x, E3 and E3y (linked with short or long linkers) 
were attached to either a repressor domain (SKD) or activation domain (VP64). The charts 
below siramiarize such polypeptide. 

Short linker constructs: Jl 5 
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Long linker constructs: J30 
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20 Instead of the regular canonical linker between Zif domains (TGEKP) the twelve fingers 

contain longer linkers to connect the six finger proteins. They were introduced by PGR using the 
forward primers J15F or J30F and pMalseq Back as reverse primer. The PGR template can be 
a regular three or six finger in pMal (See Scheme 1 , below): 

25 Sequence Primer short linker J15: 
Sequence Range: 1 to 90 
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30 >Xmal| 

I I 

1 I 10 20 30 

TGA GCC CGG GGG CGG TGG CTC GGG COG TGG 
ACT CGG GCC CCC GCC ACC GAG CCC GCC ACC 

35 

>BsrFI >BglII 

I I 
>AgeI >XbaI | 

I I 1 

40 1 40 I I so 60 

TGG GAC CGG TTC CTC TAG ATC TTC CTC CAC 
ACC CTG GCC AAG GAG ATC TAG AAG GAG GTG 



45 



70 80 90 

CGG GGA GAA GCC CTA TGC TTG TCC GGA ATG (SEQ ID NO: 16) 
GCC CCT CTT CGG GAT ACG AAC AGG CCT TAC (SEQ ID NO: 17) 
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Sequence Primer Long Linker J30F: 
Sequence Range: 1 to 99 

>XmaI 

I 

I 10 20 30 

CCC GGG TCC TCT GGT GGC GGT GGC TCG GGC 
GGG CCC AGG AGA CCA CCG CCA CCG AGC CCG 

40 50 60 

GGT GGT GGG GGT GGT TCC ACT GGC GGT GGC 
CCA CCA CCC CCA CCA AGG TGA CCG CCA CCG 

>AgeI 
I 

70 I BO 90 

TCG GGC GGT GGT GGG ACC GGT TCC TCT AGA 
AGC CCG CCA CCA CCC TGG CCA AGG AGA TCT 

TCT TCC TCC <SEQ ID NO: 18) 
AGA AGG AGG (SEQ ID NO: 19) 
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Scheme 1 : Cloning relevant restriction sites of a six finger protein in pMal 



Six Finger Restriction Sites 
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The PGR Product was cleaved with Xmal and Spel and any original Zif protein 
cleaved with Agel and SpeL The long Unker containing zinc finger was then inserted 
between Agel and Spel. Xmal/Agel form compatible cohesive ends and both restriction sites 
(Xmal/Agel) disappear during that cloning step. A new finger can be inserted by cutting this 
construct AgeFSpel and the original finger Xmal/Spel. Thus the resulting twelve finger has 
the same restriction sites as a six finger (Scheme 2). Consequently the assembly can be 
extended to n fingers, and can also combine three with six fingers etc. 
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Scheme 2: Cloning relevant restriction sites of a twelve finger protein 
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EXAMPLE 2: Rinding to Tarp ^* r)NA and Trnnsrription Regulation 

Polypeptides ftom Example 1 were tested for their binding to specific DNA target 
sequences and for their abiUty to alter transcription. The results are summarized below. 

The 12 finger fusion proteins between e2c/e2x and e3/E3y are able to regulate both 
genes at once. This hypothesis was tested by transfecting the different pMXSKD-12 finger 
and pMX12finger vp64 constructs in 293-Gag-Pol ceUs and infecting A431 cells with the 
resulting virus. Three days after infection the cells were harvested and analyzed for erbB2 
and erbB3 expression levels by FACS. This procedure was done as described previously 
(Segal, D et al: PNAS(1999), 96,2758-2763). ELISA data of raw extracts of e2cJ15/30E3 
and e2cJl 5/30e3y show that aU four constructs bind their respective targets. 

Seven different constructs have been analyzed for regulatory effects on erbB2 and 
erbB3. Down regulation of both erbB2 and erbB3 to basal levels was observed for 
pMXSKDE2cJ15E3. Most efficient up regulation of both erbB2 and erbB3 was observed 
with the construct pMX E2cJ15E3vp64. pMX E2xJ15E3vp64 also affects both genes, as well 
as e2cE3yvp64. 

pMXe2cJ15e3 repressed erbB2 and erbB3 down to basal levels. The other three 
constructs work worse, they just affect erbB3 expression without repressing eibB2. This is 
true for both, e2x. which has lower affinity to its target (Km=l 5nM), and the high affinity 
(Km=0.5nM) e2c containing constructs. As e2x is once on the N-terminus, and once on the 
C-terminus, different positioning within the twelve finger does not improve erbB2 repression. 
This is not a question of zinc finger protein expression levels, as estimated by GFP 
expression. pMXe2cJ15e3 is one of the weaker expressors and the most effective repressor. 
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Also for the activators, pMXe2cJ15e3vp64 showed the best effect by clearly 
activating erbB2 and erbB3, In contrast to the repressors, however, the two other constructs 
also activated both genes. ErbB3 seems to be activated a bit stronger compared to erbB2. 

Double targeting within one promoter could increase the overall weak activation 
effect of zinc fingers. For that purpose pcDNAe2cJ15e2xvp64 and pcDNASKDe2cJ15e2x 
were transientiy transfected in Hela cells, together with the Luciferase reporter construct E2p. 
36 fold repression was observed for SKDe2c compared to 8 fold repression for 
SKDe2cJ15e2x. For activation, 45 fold activation was observed for the twelve finger 
compared to 78 fold activation by vp64e2c. 



m 
was 



The twelve finger construct pMXe2cJ15CD144#5 does activate erbB2 but not CD144 
A431 cells. Two independent clones were tested and showed ttie same effect. One clone 
fiilly sequenced and just one aa of the last helix was unreadable or ambiguous. 



Overview of repressor domains tested with erbB2 



Repression Domain 



SKD (entire KRAB-domain) 



Repression of erbB2 (as measured in 
transient reporter assay, Beerli et al.(1998) 
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EXAMPLES: Heneral procedures 

Construction of Zinc Finger Library and Selection via Phage Displu)^ 

Construction of the zinc finger library was based on the C-7 protein (United States 
Patent Number 6,140,081). Finger 3 recognizing the 5»-GCG-3» subsite was replaced by a 
domain binding to a 5»-GAT-3' subsite via a PGR overlap strategy using a primer coding for 
finger 3 (S^-GAG-GAAGTTTGCCACCAGTGGCAACCTGGTGAGGCATACCAAAATC- 

3')(SEQ ID NO:20) and a vector-specific primer (5'- 

GTAAAACGACGGCCAGTGCCAAGC-30(SEQ ID NO:21). Randomization of the zinc 
finger Ubrary by PGR overlap extension was essentially as is well known in the art. The 
Ubrary was ligated into the phagemid vector pGomb3H. Growth and precipitation of phage 
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were perfonned using standard techniques. Binding reactions were performed in a volume of 
500 ml of zinc buffer A (ZBA: 10 mM Tris, pH 7.5, 90 mM KCl, 1 mM MgCl^, 90mM 
ZnC12 ). 0.2% bovine serum albumin, 5 mM dithiothreitol, 1% Blotto (Bio-Rad), 20 mg of 
double-stianded, sheared herring sperm DNA containing 100 ml of precipitated phage (10" 
colony-forming units). Phage were allowed to bind to non-biotinylated competitor 
oUgonucleotides for 1 h at 4-C before the biotinylated target oUgonucleotide was added. 
Binding continued overnight at 4-C. After incubation with 50 ml of streptavidin-coated 
magnetic beads (Dynal; blocked with 5% Blotto in ZBA) for 1 h. beads were washed 10 times 
with 500 ml of ZBA, 2% Tween 20, 5 mM dithiothreitol, and once with buffer containing no 
Tween. Elution of bound phage was performed by incubation in 25 ml of trypsin (10 mg/ml) 
in Tris-buffered saline for 30 min at room temperature. 

Hairpin competitor oligonucleotides had the sequence 5'-GGCCGCN'N'N'AT 
CGAGTTTTCTCGATNNNGCGGCC-S' (SEQ ID NO:22), where NNN represents the 
finger-2 subsite oUgonucleotides and N'N'N' its complementary bases. Target 
oUgonucleotides were biotinylated and usually added at 72 nM in the first three rounds of 
selection and then decreased to 36 and 1 8 nM in the sixth and last round. As competitor a 5'- 
TGG-3' finger-2 subsite oUgonucleotide was used to compete with the parental clone. An 
equimolar mixture of 15 finger-2 5'-ANN-3' subsites, except for the target site and competitor 
mixtures of each finger-2 subsites of the type 5'-CNN-3', 5'-GNN-3', and 5'-TNN-3' were 
added in increasing amounts with each successive round of selection. Usually no specific 5'- 
ANN-3' competitor mix was added in the first round. 

Multitarget Specificity Assay and Gel Mobility Shift Analysis-Ths zinc finger-coding 
sequence was subcloned fi^m pCombSH into a modified bacterial expression vector pMal-c2 
(New England Biolabs). After transformation into XLl-Blue (Stratagene) the zinc finger- 
maltose-binding protem (MBP) fiisions were expressed by addition of 1 nM isopropyl b-D- 
thiogalactoside (IPTG). Freeze/thaw extracts of these bacterial cultures were appUed in 1 :2 
serial dilutions to 96-well plates coated with streptavidin (Pierce) and were tested for DNA 
binding specificity against each of the 16 5'-GAT ANN GCG-3' target sites. Enzyme-Unked 
immunosorbent assay (ELISA) was performed. After incubation with a mouse anti-MBP 
antibody (Sigma, 1 :1000), a goat anti-mouse antibody coupled with alkaline phosphatase 
(Sigma, 1:1000) was appUed. Detection occurred by addition of alkaline phosphatase 
substrate (Sigma), and the A405 was determined by a microtiter plate reader with 
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SOFTMAX2.35 (Molecular Devices). Gel shift analysis was perfomed with purified protein 
(Protein Fusion and Purification System, New England Biolabs). 

Site-directed Mutagenesis of Finger 2— Finger-2 mutants were constructed by PCR. As PCR 
template the pMal vector encoding for C7.GAT was used. PCR products containing a 
mutagenized finger 2 and 5'-GAT-3' finger 3 were subcloned via iVnl and ^Ipel restriction 
sites in frame with finger 1 of C7 (5'-GCG-3') into a modified pMal-c2 vector (New England 
Biolabs). 

Construction ofPolydactyl Zinc Finger /Votei«s-Three-finger proteins were constructed by 
finger-2 stitchery using the SPl C framework. The proteins generated in this work contained 
helices recognizing 5'-GNN-3' DNA sequences, as well as 5'-ANN-3' and 
5'-TAG-3' helices. Six finger proteins were assembled via compatible ^al and 5otFI 
restriction sites. Analysis of DNA-binding properties were performed using freeze/thaw 
extracts from IPTG-induced bacteria. For the analysis of the capability of these proteins to 
regulate gene expression, they were fiised to the activation domain VP64 or repression 
domain KRAB of Kox-1; VP64 (tetrameric repeat of the herpes simplex virus VP16 minimal 
activation domain) and subcloned into pcDNA3 (Invitrogen) or the retroviral pMX-IRES- 
GFP vector) internal ribosome-entry site (IRES) and green fluorescent protein (GFP). 

Transection and Luciferase Assays-YIAA ceUs were used at a confluency of 40-60%. Cells 
were transfected with 160 ng of reporter plasmid (pGL3; Promega) containing the promoter 
sequence with zinc finger-binding sites and 40 ng of effector plasmid (zinc finger-effector 
domain fiisions in pcDNA3) in 24-well plates. Cell extracts were prepared 48 h after 
transfection and measured with luciferase assay reagent (Promega) in a MicroLumat LB96P 
luminometer (EG & Berthold, Gaithersburg, MD). 

Retroviral Gene Targeting and Flow Cytometric^ks primary antibody an ErbB-1 -specific 
mAb EGFR (Santa Cruz Biotechnology), ErbB-2-specific mAb FSP77 (gift from Nancy E. 
Hynes), and an ErbB-3-specific mAb SGPl (Oncogene Research Products) were used. 
Fluorescently labeled donkey F(ab9)2 anti-mouse IgG was used as secondary antibody 
(Jackson ImmunoResearch). 



wo 03/104414 



-25- 



PCTAJS03/17946 



Eacterial extracts of vMal-fijsion proteins fn r KTJSA assays 

The selected zinc finger proteins were cloned into the pMal vector (New England 
Biolabs) for expression. The constructs were transferred into the E. coli strain XLl-Blue by 
electroporation and streaked on IB plates containing 503g/ml caibenecUlin. Four single 
colonies of each mutant were inoculated into 3 ml of SB media containing 50 3g/ml 
caibenecUlin and 1% glycose. Cultures were grown overnight at 37°C. 1 .2 ml of the cultures 
were transformed into 20 ml of fresh SB media containing 50 3g/ml carbeneciUin. 0.2% 
glycose, 90 3g/ml ZnCl^ and grown at 37°C for another 2 hours. IPTG was added to a final 
concentration of 0.3 mM. Incubation was continued for 2 hours. The cultures were 
centrifuged at 4°C for 5 minutes at 3500 ipm in a Beckman GPR centrifuge. Bacterial pellets 
were resuspended in 1.2 ml of Zinc Buffer A containing 5 mM fresh DTT.. Protein extracts 
were isolated by fi^ze/thaw procedure using dry ice/ethanol and warm water. This 
procedure was repeated 6 times. Samples were centrifuged at 4°C for 5 minutes in an 
Eppendorf centrifiige. The supernatant was transferred to a clean 1.5 ml centrifiige tube and 
used for the ELISA assays. 

ELISA assays - Finger-2 variants of C7.GAT were subcloned into bacterial expression vector 
as fixsion with maltose-binding protein (MB?) and proteins were expressed by induction with 
1 mM IPTG (proteins (p) are given the name of the finger-2 subsite against which they were 
selected). Proteins were tested by enzyme-linked immunosorbent assay (ELISA) against each 
of the 16 finger-2 subsites of the type 5'-GAT CNN GCG-3' to investigate their DNA- 
binding specificity. 

In addition, the 5'-nucleotide recognition was analyzed by exposing zinc finger 
proteins to the specific target oHgonucleotide and three subsites which differed only in the 5'- 
nucleotide of the middle triplet. For example, pCAA was tested on 5'-AAA-3', 5'-CAA-3'. 
5'-GAA-3', and 5'-TAA-3' subsites. Many of the tested 3-fmger proteins showed exquisite 
DNA-binding specificity for the finger-2 subsite against they were selected. 
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1 . A non-naturally occurrmg transcription factor polypeptide comprising a 
plurality of DNA binding domains operatively linked with a flexible peptide linker having 
jfrom 5 to 50 amino acid residue sequences. 

2. The polypeptide of claim 1 that contains 2 or 3 DNA binding domains. 

3. The polypeptide of claim 1 wherein the linker has &om 5 to 30 amino acid 
residues. 

4. The polypeptide of claim 1 wherein the linker has from 5 to 15 amino acid 
residue sequences. 

5. The polypeptide of claim 1 further comprising a transcription regulating factor. 

6. The polypeptide of claim 5 wherein the transcription regulating factor 
represses transcription. 

7. The polypeptide of claim 5 wherein the transcription regulating factor 
activates transcription. 

8. The polypeptide of claim 5 wherein the transcription regulating factor is 
located at the N-terminus of the polypeptide. 

9. The polypeptide of claim 5 wherein the transcription regulating factor is 
located at the C-terminus of the polypeptide. 

1 0. The polypeptide of claim 1 further comprising a plurahty of transcription 
regulating factors. 
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1 1 . The polypeptide of claim 1 wherein each DNA binding domain contains from 
3 to 6 zinc jBnger DNA binding peptides. 

12. The polypeptide of claim 1 1 wherein each DNA binding domain contains 6 
5 zinc finger DNA binding peptides. 

13. The polypeptide of claim 12 further comprising one or more transcription 
regulating domains. 

10 14. A polynucleotide that encodes the polypeptide of claim 1. 

15. An expression vector that contains the polynucleotide of claim 14. 

16. A cell transfected with the polynucleotide of claim 14. 

15 

17. A process of altering expression of a nucleotide sequence containing a binding 
motif, comprising the step of contacting the binding motif with an effective 
amount of a polypeptide of claim 1 that binds to the motif. 

20 18. A process of simultaneously altering expression of a first nucleotide sequence 

containing a first binding motif and a second nucleotide sequence that contains 
a second binding motif, the process comprising the step of contacting the 
binding motifs with an effective amoimt of a polypeptide of claim 1 that binds 
to both the first and second binding motif. 
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